Model Selection

ONNX Format

# ONNX Format

Qwen3 1.7B ONNX

Qwen3-1.7B is a 1.7B-parameter open-source large language model released by Alibaba Cloud, based on the Transformer architecture, supporting various natural language processing tasks.

Large Language Model

Stt Ru Fastconformer Hybrid Large Pc Onnx

NVIDIA FastConformer-Hybrid Large is a Russian automatic speech recognition model based on the FastConformer architecture, supporting CTC and RNN-T decoders.

Speech Recognition

Grounding Dino Tiny ONNX

A lightweight zero-shot object detection model in ONNX format, compatible with Transformers.js, suitable for browser-side deployment.

Object Detection

Granite Timeseries Patchtst

IBM Granite series time series forecasting model, based on PatchTST architecture, suitable for various time series forecasting tasks.

Mediapipe Selfie Segmentation Landscape

A lightweight portrait segmentation model in ONNX format, specifically optimized for separating people from backgrounds in landscape images.

Image Segmentation

Timesformer Hr Finetuned K600

TimeSformer-HR is a video action recognition model optimized for high-resolution videos and fine-tuned on the Kinetics-600 dataset.

Video Processing

Timesformer Base Finetuned Ssv2

TimeSformer is a Transformer-based video understanding model specifically optimized for temporal action recognition tasks.

Video Processing

Timesformer Base Finetuned K600

TimeSformer is a video understanding model based on the Transformer architecture, specifically designed for video classification tasks.

Video Processing

Timesformer Base Finetuned K400

TimeSformer is a Transformer-based video understanding model, specifically fine-tuned on the Kinetics-400 dataset.

Video Processing

Whisper Base.en

Whisper is a general-purpose speech recognition model trained by OpenAI. This model is based on large-scale weakly supervised training and supports speech transcription in multiple languages.

Speech Recognition

Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting multilingual speech transcription.

Speech Recognition

4x APISR GRL GAN Generator Onnx

GAN-based 4x super-resolution image upscaling model, compatible with Transformers.js

Image Enhancement

Gyr66 Bert Base Chinese Finetuned Ner Onnx

This is the ONNX format conversion version of the gyr66/bert-base-chinese-finetuned-ner model, designed for Chinese named entity recognition tasks.

Sequence Labeling

Transformers Chinese

Depth Anything Large Hf

ONNX version of depth estimation model based on Transformers.js, suitable for web applications

Fmops Distilbert Prompt Injection Onnx

This is the ONNX format conversion of the fmops/distilbert-prompt-injection model, designed for detecting prompt injection attacks.

Large Language Model

Transformers English

Bert Base NER Onnx

This is the ONNX format version of the dslim/bert-base-NER model for named entity recognition tasks, capable of identifying four entity types: location, organization, person, and miscellaneous.

Sequence Labeling

Transformers Supports Multiple Languages

Dpt Hybrid Midas

Hybrid depth estimation model developed by Intel, combining the advantages of convolutional neural networks and Transformer architecture

Swin2sr Lightweight X2 64

Lightweight Swin2SR image super-resolution model that can upscale image resolution by 2 times

Image Enhancement

Swin2sr Classical Sr X2 64

A classical image super-resolution model based on Swin2SR architecture, capable of upscaling image resolution by 2 times

Image Enhancement

Trocr Base Handwritten

A Transformer-based handwritten text recognition model that converts handwritten images into text

Trocr Small Handwritten

A small Transformer-based handwritten text recognition model optimized for web usage

Text Recognition

Trocr Base Printed

TrOCR is a Transformer-based OCR model specifically designed for recognizing printed text.

Text Recognition

Trocr Small Printed

TrOCR-small-printed is a compact optical character recognition (OCR) model specifically designed for printed text recognition.

Text Recognition

Gbert Large Paraphrase Cosine Onnx

A German text embedding model based on sentence-transformers, mapping text to a 1024-dimensional vector space, specifically designed to enhance few-shot text classification performance in German

Transformers German

blackcodetavern

Distilbart Cnn 12 6

DistilBART-CNN-12-6 is a distilled version of the BART model, optimized for text summarization tasks, with a smaller size while maintaining high performance.

Text Generation

YOLOS-small is a small object detection model based on the Transformer architecture, designed for efficient visual tasks.

Object Detection

YOLOS-tiny is a lightweight object detection model based on the Transformer architecture, suitable for real-time object detection tasks.

Object Detection

Tinystories 1M ONNX

TinyStories-1M-ONNX is a small language model based on the ONNX format, suitable for text generation tasks.

Large Language Model

Transformers English

E5-small-v2 is an efficient text embedding model suitable for various natural language processing tasks.

Wav2vec2 Base 960h

ONNX format conversion of Facebook's wav2vec2-base-960h model, designed for Transformers.js, supporting browser-side speech recognition

Speech Recognition

MMS-LID-4017 is a speech recognition model supporting 4017 languages, developed by Facebook, focusing on language identification tasks.

Text Classification

MMS-LID-126 is a multilingual speech recognition model released by Facebook, supporting recognition of 126 languages.

Text Classification

Ast Finetuned Speech Commands V2

A voice command recognition model based on AST architecture, optimized for web deployment in ONNX format

Audio Classification

Ast Finetuned Audioset 10 10 0.4593

Audio Spectrogram Transformer (AST) model fine-tuned on the AudioSet dataset for audio classification tasks

Audio Classification

Whisper Medium is a medium-scale speech recognition model developed by OpenAI, supporting automatic speech recognition (ASR) tasks in multiple languages.

Speech Recognition

Detr Resnet 101

End-to-end object detection model based on Transformer architecture with ResNet-101 feature extractor

Object Detection

Whisper Small is a small automatic speech recognition (ASR) model developed by OpenAI, capable of converting speech into text.

Speech Recognition

Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting speech-to-text tasks in multiple languages.

Speech Recognition

Whisper Tiny is a lightweight speech recognition model open-sourced by OpenAI, suitable for web deployment.

Speech Recognition

A large-scale text summarization model based on the BART architecture, optimized for the CNN/DailyMail dataset

Text Generation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase